Prom cellular properties to population asymptotics 
in the Population Balance Equation 
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Proliferating cell populations at steady state growth often exhibit broad protein distributions with 
exponential tails. The sources of this variation and its universality are of much theoretical interest. 
Here we address the problem by asymptotic analysis of the Population Balance Equation. We show 
that the steady state distribution tail is determined by a combination of protein production and cell 
division and is insensitive to other model details. Under general conditions this tail is exponential 
with a dependence on parameters consistent with experiment. We discuss the conditions for this 
effect to be dominant over other sources of variation and the relation to experiments. 

PACS numbers: 87.10.-e, 87.15. A-, 87.17.Ee, 87.23.Cc 
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Biological cell populations are diverse in their phys- 
iological properties, even if genetically identical. Since 
physiology rather than genetics ultimately carries biolog- 
ical function, there is much interest in understanding this 
aspect of biological variation. A good model system for 
this problem is a microorganism population that is genet- 
ically uniform and grows under uniform conditions; these 
systems have been studied for many years, and have re- 
cently received renewed attention following developments 
in experiment design and technique of single-cell mea- 
surements (reviewed by [H, 0]). Experiments using fluo- 
rescence tagging combined with microscopy and cytom- 
etry have focused on variation in particular proteins in- 
side cells, while theoretical studies have provided models 
of specific circuits and noise sources. Under steady state 
growth conditions, several experiments have shown that 
even for regulated proteins, distribution shapes are insen- 
sitive to many details and are often observed to be broad 
with exponential tails This calls for a more physical 

perspective of the problem, raising questions such as the 
universality of the resulting distributions. We here show 
that an exponential tailed distribution with the correct 
dependence on system parameters follows from a descrip- 
tion involving a balance between deterministic protein 
production and dilution at cell division if these processes 
satisfy reasonable conditions. Such tails, reflecting varia- 
tion in division time, are thus expected even if stochastic 
fluctuations in gene expression are negligible. The con- 
ditions for this effect to be dominant relative to noise in 
protein production are discussed. 



A general theoretical framework for describing popu- 
lation distributions of quantities that obey a balance of 
growth and division, such as cell size or protein content, 
is the Population Balance Equations (PBE) @, In 
its most general form it can incorporate many details 
and multiple internal cellular properties. We here focus 
on the case where the relevant physiological property of 



each cell can be described by a single variable x @, [1] : 
^/(*, x) + ^ [g(x)f(t, x)] = -b{x)f(t, x) (1) 

f (i) f (*' p) dp ~ x) l°° bmt > 

Here f(x) is the probability density for the quantity x in 
the population, and g{x) is the individual growth rate of 
x. Cell division is assumed to follow a "sloppy control" 
mechanism |9| : b(x) is the probability per unit time for a 
cell of quantity x to divide. Once division occurs, d(p) is 
the probability for dividing into two daughter cells with 
fractions p and 1— p of the mother cell. To obey mass 
conservation d(p) = d(l — p). The last term in the equa- 
tion accounts for normalization. Underlying this model 
is the assumption that the growth process occurs gradu- 
ally and with small fluctuations throughout the cell cycle, 
whereas division abruptly induces a large change in x. 

A large body of previous work on this model is dedi- 
cated to theorems regarding the existence and uniqueness 
of solutions [hJ, numerical algorith ms ( [llT] and refer- 
ences there) and special case solutions [12| ■ Traditionally 
the coordinate x was interpreted as related to cell size 
(mass, linear dimension etc.), and the dependence of the 
probability per unit time to divide on cell size reflects the 
combination of deterministic size-dependent and random 
aspects of cell division Q. However, for our purpose of 
analyzing the asymptotic properties of the steady-state 
distributions, x can also be interpreted as the amount of 
a particular protein or molecule in the cell, any quantity 
which is produced and preserved at cell division. This 
follows because the probability per unit time to divide 
generally saturates for large values of cell size, age, or 
protein content, reflectin g th e inherent probabilistic com- 
ponent of the cell cycle [13|. This point, as well as the 
effect of an additional stochastic component in g(x), will 
be further discussed below. 

Our analysis begins by considering the steady state 
solution of Eq. fT]). Assuming such a solution exists, 
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FIG. 1: Steady state population distributions with exponen- 
tial tails. Numerical solution for constant growth g(x)= , y 
and saturating probability of division per unit time. (□,(>): 
b(x) = boH(x — 9); d(p) sum of two Gaussians at p = 0.3 and 
p = 0.7. This function describes asymmetric division, such 
as that observed for budding yeast cells. (V,0) : b(x) — 
& [tanh(fc(x - $)) + 1] with k = 5, 9 = 1 and d{p) = 1. 
Asymptotic approximations (Eq. ([8])) are shown by solid lines. 



f(t,x)=f(x) and the last integral becomes a constant, 
J °° b(x)f(t,x)dx = R. This constant is the specific 
growth rate of the number of cells in balanced exponen- 
tial growth, and can be viewed as a parameter in the 
equation. Therefore at steady state, 



— [g(x)f(x)} = -b(x)f(x) 
+ 2j o ld M b (^f(^d P -Rf(x). 



(2) 



Now consider the incoming flow contributing to the prob- 
ability density at large x, where f{x) is a decreasing func- 
tion. It comes from two processes: growth, bringing cells 
of low i to a higher one; and division, breaking high- a; 
cells into pairs of smaller x. If the probability density de- 
creases rapidly enough, then for large x the first of these 
incoming flows is dominant over the second. We shall 
assume that this is the case for now, neglect the integral 
term representing the second flow in Eq. (|2|), and return 
to examine the consistency of this assumption later. One 
then obtains the following ordinary differential equation: 



dx 

with the solution: 



[g{x)f(x)} = -(b(x) + R)f(x) 



f(x) = Cexp(-J 



b(Q + R + g'(S) 
9(0 



(3) 



(4) 



A related integral was found for the case of exactly sym- 
metric division and a finite ranged variable Q. Here we 
argue that in general under the assumption of a rapidly 
decreasing f(x) the ratio between two points at the tail 
of the distribution is given by Eq. (T5)) with the limits of 
integration at the two points. 



If x represents cell size, g(x) is the growth function of 
the individual cell. Experiments directly measuring this 
function are not straightforward 14]; theoretical works 
have mostly assumed either linear or constant functions 
for simplicity. If x is interpreted as the amount of a pro- 
tein, then a constant g represents a mean rate of protein 
production that is independent of the protein level. As- 
suming g(x) = 7 and a saturating probability per unit 
time to divide b{x) — >&o for large x, 



K=(b +R)fy. 



(5) 



Returning now to the question of the validity of the naive 
approximation Eq. a resulting exponential tail hints 
to consistency of the approximation since the function 
decreases rapidly. More precisely, we assumed that 



--{g( X )f( X )]. (6) 



Substituting the above exponential one finds that this 
requirement is satisfied by x >> d max p/ k 2 , where d max — 
max p {d(p)} and p = 260/7; this defines the regions of 
consistency of the approximation. 

The Population Balance Equation Eq. (fT]) can be 
solved numerically ([ll[ and references there). We have 
developed a numerical procedure to solve the time- 
dependent equation on a semi-infinite range based on the 
method of time-evolution operators 0] • Fig. Q] shows the 
steady state solution with functions g, b that saturate at 
large x. As predicted by the argument above, the distri- 
butions exhibit exponential tails. Starting the dynamics 
from various initial conditions always relaxed to the same 
steady state distribution. An exponential tail was found 
for all division functions d(p), consistent with Eq. (JSJ). 

Using this observation, we proceed without much loss 
of generality to a more accurate asymptotic approxima- 
tion for the case d(p)=l. Assuming once again g(x)= r y 
and b(x)—>bo for large x, Eq. ^ is equivalent, by a change 
of variables and an additional differentiation, to 



~j 3 P f\) = °- 

dx z dx x 



(7) 



x = 00 is an irregular singular point of this equation [TtT ] . 
Trying a solution f(x) = exp(/ix)x x ri(x) with \£lZ and 
f](x) analytical at x = 00, we obtain to leading order: 



f(x)x- 



C x x 



+ C 2 x 



-p/t, 



(8) 



Since Eq. ([7]) is of second order we have two indepen- 
dent solutions; however, as < R < bo it follows that 
1 < p/k < 2 and hence the mean of the second solution 
diverges. This observation, while obviously not a proof 
of uniqueness, supports the numerical result of relaxation 
to a unique steady state distribution from many initial 
conditions. 

An exactly solvable case occurs when b(x) = bo, then 
f(x) = K 2 xe~ KX . Here R = bo, then k — p so the first 
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FIG. 2: Steady state distributions for g(x) = yx, b(x) — box 
and d(p) — 1. This example shows that the distribution tail 
is determined by the ratio between b(x) to g(x) and not by 
each function separately. 
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FIG. 3: Steady state distributions for g(x) — yx, b(x) = 
^ [tanh(fc(z - 0)) + 1] with k = 2, 6 = 2 and d(p) = 1. Nu- 
merical solutions (symbols) are shown with asymptotic ap- 
proximation to the tail (lines). 



asymptotic function in Eq. ([5]) is an exact solution; the 
second, f(x)^x~ 1 , is non-normalizable. The PBE here 
reduces to a model studied in where protein is pro- 
duced at a constant rate and cells divide with constant 
probability per unit time. 

We thus establish that under general conditions the 
steady state distribution exhibits an exponential tail, as 
has been observed in several experiments d i). The 
exponential tail is obtained neglecting variation in the 
source g, and stems from a balance between the first- 
order kinetics of cell division and a constant or saturating 
deterministic source. The dependence of the exponent 
on parameters is such that upon increase of production, 
represented by g, the exponential tail broadens. This is 
consistent with experimental observations on protein pro- 
duction at steady state in populations of yeast cells [3] , 
and inconsistent with most models that account for pop- 
ulation variation by production noise. 

Formally Eq. J3J indicates that the distribution tail 
is determined by the ratio of the growth and division 
functions, not by each of them separately. Thus, if for 
large x these functions do not saturate but have the same 
x-dependence, an exponential tail will also arise. Fig. 
2 shows the numerical solution for linearly increasing 
g{x), b(x), supporting this prediction. While not imme- 
diately relevant to protein production, this result illus- 
trates how exponential tails can arise by different growth 
and division functions maintaining constant ratio. It thus 
supports our analytic conclusion about how the combi- 
nation of these functions shapes the distribution tails. 

A growth, or production, function g(x) that increases 
with x is relevant for several biological contexts. For 
example, if food uptake is related to the surface area of 
the organism and x is a linear dimension, then growth 
is an increasing function of x Q- For g(x) — yx one can 
show that R = y and therefore k — p/2 + 1. Using the 
same procedure as before to write an equivalent ordinary 
differential equation for b(x) saturating to bo at large x 



and d(p) = 1, we find 

^ + [p/2 + 3] if + P ^f = 0. (9) 

aar x ax x z 

This is the Euler equation [13] with power-law solutions 
f(x) = Cx a where a = —p/2, —2. Of the two indepen- 
dent solutions to the asymptotic equation, only a= —p/2 
with p> 4 is consistent with f(x) being a probability den- 
sity with a finite mean. Indeed, numerical simulations in 
this parameter regime always relax to a steady state with 
a tail f(x) ~ x~ b °/ 1 ; see Fig.|3]for a comparison between 
the numerical solution and the asymptotic tail. 

The special case of b(x) = boH(x — 9), where H is 
the Heaviside function with threshold 9, is exactly solv- 
able. Here the Euler equation Eq. ((9]) holds exactly in the 
region x > 9. By continuity and normalization require- 
ments one can show that the coefficient of the solution 
with a = —2 is exactly zero, and the unique solution is 

f (1 - 2/p) ■ i x < 9 

m = { (io) 

[ (1 - 2/p) • Qp/^x-p/z x> 

Once again, this solution is valid for p > 4 (po > 2y). 
Note that the naive argument leading to Eq. ((4J is self- 
consistent in this case only for a more severely limited 
region of parameters (bo >> 3y). 

In summary, we used the population balance equation 
(PBE) to study the interplay between intracellular and 
population processes in shaping the steady state distribu- 
tion in a dividing cell population. The novel component 
in our approach is to consider the variable x describing 
the cell state as unbounded and to focus on the asymp- 
totic properties of its distribution. This enables us to 
extend the interpretation of particular protein or 

molecule in the cell, since asymptotically the probability 
per unit time to divide becomes independent of the vari- 
able, b(x) — ► bo for large a;. This probabilistic component 
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of the cell cycle is a well established property for many dividing organism exhibits similar exponential tails. 

Taken together, our results suggest that exponential 
tails in the distribution of an abundant protein in a di- 
viding population may be a much more universal feature 
than previously thought, since they reflect fundamental 
properties of randomness in cell division times and not 
necessarily the particular microscopic details of protein 
production circuits. 
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cell types 

We have shown that generally the functional forms of 
mean growth or production g(x) and probability per unit 
time to divide b(x) determine the tail of the distribution 
through a particular combination, Eq. ([?]). Because the 
PBE takes into account the kinetics of cell division as a 
discrete process, randomness in the timing of cell division 
is sufficient to yield an exponentially tailed distribution 
at steady state. In reality, the single-cell function g(x) 
itself has a stochastic component, and this can be added 
to the model using the diffusion approximation. Such an 

Sg 2 



extension will be a good approximation if jAp? *C i, a +R • 
At the other extreme, if internal stochasticity is dom- 
inant, it should be modeled in detail. For example, pre- 
vious work has shown that bursts in mRNA production 
cause an exponential distribution of protein produced in 
each cell, which in turn is reflected as exponential tails 
in the population distribution lj|, 2(J 2l[. Division can 
then be assumed synchronous with symmetric binomial 
distribution [ijj [2Q], or it can be altogether neglected 
and described as a continuous dissipative process [2l| . 
without changing the result. The validity of each regime 
depends on the relative variation of the two processes, 
production and division, and on their relative time scales. 
One way to identify the regime in experiment is the de- 
pendence of the exponential tail on parameters: if the tail 
results from microscopic effects, then a larger mean pro- 
duction results in relatively narrower distributions and 
the slope of the tail remains intact. However, if the ex- 
ponent results from a combination of sloppy division and 
deterministic production as suggested here, then larger 
mean production results in a broader exponential tail. 
Experiments on yeast populations have shown that in- 
creasing the mean protein production, either by an in- 
crease in the number of promoters or by adding inducing 
agents, increases the mean and at the same time broadens 
the exponential tail [3] . This dependence suggests that it 
is the population effects, rather than microscopic noise, 
which govern the distribution tails in these experiments. 

In any interpretation of x, our results predict that the 
distribution tails will be insensitive to the division func- 
tion d(p). This is supported by the universality of pro- 
tein distribution tails inyeast cells grown under various 
steady state conditions . Yeast cells divide asymmetri- 
cally, with the degree of asymmetry depending on growth 
rate and environment Q. The observation that under 
all growth conditions the protein distribution exhibited 
exponential tails is consistent with our prediction. More- 
over, unpublished results on bacteria populations grown 
at steady state [22| show that even this symmetrically 
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